Learning Hadoop 2 by Garry Turkington & Gabriele Modena
Author:Garry Turkington & Gabriele Modena [Turkington, Garry]
Language: eng
Format: epub, mobi
Publisher: Packt Publishing
Published: 2015-02-12T22:00:00+00:00
Now, it is just a matter of grouping hourly_tweets by hour and then generating a count of tweets per group, as follows:
hourly_tweets_count = FOREACH (GROUP hourly_tweets BY hour) { GENERATE FLATTEN(group), COUNT(hourly_tweets); }
Sessions
DataFu's Sessionize class can help us to better capture user activity over time. A session represents the activity of a user within a given period of time. For instance, we can look at each user's tweet stream at intervals of 15 minutes and measure these sessions to determine both network volumes as well as user activity:
DEFINE Sessionize datafu.pig.sessions.Sessionize('15m'); users_activity = FOREACH tweets { GENERATE CustomFormatToISO($0#'created_at', 'EEE MMMM d HH:mm:ss Z y') AS dt, (chararray)$0#'user'#'id' as user_id; } users_activity_sessionized = FOREACH (GROUP users_activity BY user_id) { ordered = ORDER users_activity BY dt; GENERATE FLATTEN(Sessionize(ordered)) AS (dt, user_id, session_id); }
Download
Learning Hadoop 2 by Garry Turkington & Gabriele Modena.mobi
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Test-Driven iOS Development with Swift 4 by Dominik Hauser(7772)
Filmora Efficient Editing by Alexander Zacharias(5854)
The Infinite Retina by Robert Scoble Irena Cronin(5329)
Learn Wireshark - Fundamentals of Wireshark. by Lisa Bock(4014)
Linux Device Driver Development Cookbook by Rodolfo Giometti(3939)
Edit Like a Pro with iMovie by Regit(3463)
Linux Administration Best Practices by Scott Alan Miller(2863)
Linux Command Line and Shell Scripting Techniques by Vedran Dakic & Jasmin Redzepagic(2839)
MCSA Windows Server 2016 Study Guide: Exam 70-740 by William Panek(2528)
Mastering PowerShell Scripting - Fourth Edition by Chris Dent(2435)
Docker on Windows by Stoneman Elton(2323)
Kali Linux - An Ethical Hacker's Cookbook: End-to-end penetration testing solutions by Sharma Himanshu(2317)
Creative Projects for Rust Programmers by Carlo Milanesi(2281)
Hands-On AWS Penetration Testing with Kali Linux by Karl Gilbert(2115)
Hands-On Linux for Architects by Denis Salamanca(2056)
Programming in C (4th Edition) (Developer's Library) by Stephen G. Kochan(2011)
Computers For Seniors For Dummies by Nancy C. Muir(2009)
The Old New Thing by Raymond Chen(1942)
Linux Kernel Debugging by Kaiwan N Billimoria(1768)
